Self-Organizing Map for Characterizing Heterogeneous Nucleotide and Amino Acid Sequence Motifs
نویسنده
چکیده
A self-organizing map (SOM) is an artificial neural network algorithm that can learn from the training data consisting of objects expressed as vectors and perform non-hierarchical clustering to represent input vectors into discretized clusters, with vectors assigned to the same cluster sharing similar numeric or alphanumeric features. SOM has been used widely in transcriptomics to identify co-expressed genes as candidates for co-regulated genes. I envision SOM to have great potential in characterizing heterogeneous sequence motifs, and aim to illustrate this potential by a parallel presentation of SOM with a set of numerical vectors and a set of equal-length sequence motifs. While there are numerous biological applications of SOM involving numerical vectors, few studies have used SOM for heterogeneous sequence motif characterization. This paper is intended to encourage (1) researchers to study SOM in this new domain and (2) computer programmers to develop user-friendly motif-characterization SOM tools for biologists.
منابع مشابه
Mining Biological Data Using Self-Organizing Map
This paper presents a novel method of mining biological data using a self-organizing map (SOM). After partitioning a set of protein sequences using SOM, conventional homology alignment is applied to each cluster to determine the conserved local motif (biological pattern) for the cluster. These local motifs are then regarded as rules for prediction and classification. In the application to the p...
متن کاملNucleotide sequence of cDNA encoding for preprochymosin in native goat (Capra hircus) from Iran
Prochymosin is one of the most important aspartic proteinases used as a milk-clotting enzyme in cheese production. In the present investigation we report sequence of cDNA encoding goat ( Capra hircus ) preprochymosin and compare its nucleotide and deduced amino acid sequences with sequences of other ruminants preprochymosin. As bovine prochymosin, the caprine prochymosin cDNA encodes 365 amino ...
متن کاملA Large-scale Batch-learning Self-organizing Map for Function Prediction of Poorly-characterized Proteins Progressively Accumulating in Sequence Databases : Annual Report of the Earth Simulator Center April 2007 - March 2008
As a result of decoding of extensive genome sequences, a large number of proteins whose function cannot be predicted by the homology search of amino acid sequences is progressively accumulated and thus remains of no use in science and industry. A method to predict the protein function that does not depend on the sequence homology search is in urgent need. We previously developed a Batch-Learnin...
متن کاملNucleotide mutation analyses of isolated lentogenic newcastle disease virus in live bird market
Newcastle Disease (ND) is a major viral disease in Indonesia. It is an RNA virus belongs to Paramyxovirinae. It is well known that RNA virus is easily to mutate. In some cases, this mutation could generate virulence alteration. It is noted that mutation of NDV which has avirulent amino acid sequence on the cleavage site, could mutate to be virulent Newcastle Disease Virus (NDV). It is needed to...
متن کاملNucleotide and Amino Acid Changes in HN, F and SH genes of an Iranian Mumps Virus; RS-12, Following Attenuation to Vaccine Strain
Background and Aims: Wild-type RS-12 strain of mumps virus has been isolated from an Iranian patient and has been attenuated after several serial passages. This study was designed to determine nucleotide and amino acid substitutions in the HN, F and SH genes during attenuation of the wild-type virus. Materials and Methods: Required viral samples prepared at Razi Vaccine and Serum Institute. Vi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computation
دوره 5 شماره
صفحات -
تاریخ انتشار 2017